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1 GPGPU: general purpose computation on graphics hardware Q 
^ David Luebke, Mark Harris, Jens Kruger, Tim Purcell, Naga Govindaraju, Ian Buck, Cliff 

^ Woolley, Aaron Lefohn 

August 2004 Proceedings of the conference on SIGGRAPH 2004 course notes GRAPH 
•04 

Publisher: ACM Press 

Full text available: ^ pdf(63.03 MB) Additional Information: full citation , abstract 

The graphics processor (GPU) on today's commodity video cards has evolved into an 
extremely powerful and flexible processor. The latest graphics architectures provide 
tremendous memory bandwidth and computational horsepower, with fully programmable 
vertex and pixel processing units that support vector operations up to full IEEE floating 
point precision. High level languages have emerged for graphics hardware, making this 
computational power accessible. Architecturally, GPUs are highly parallel s ... 

2 Opening the neural network black box: an algorithm for extracting rules from function Q 
approximating artificial neural networks 

Rudy Setiono, Wee Kheng Leow, James Y. L. Thong 

December 2000 Proceedings of the twenty first international conference on 

Information systems 
Publisher: Association for Information Systems 

Full text available: ^ pdf(206.40 KB) Additional Information: full citation , references , index terms 



Keywords: decision rules, knowledge acquisition, neural networks 



3 Poster papers: Extracting decision trees from trained neural networks Q 
^ Olcay Boz 

July 2002 Proceedings of the eighth ACM SIGKDD international conference on 

Knowledge discovery and data mining 
Publisher: ACM Press 

Full text available: ^ pdf(683.99 KB) Additional Information: full citation , abstract , references , index terms 

Neural Networks are successful in acquiring hidden knowledge in datasets. Their biggest 
weakness is that the knowledge they acquire is represented in a form not understandable 
to humans. Researchers tried to address this problem by extracting rules from trained 
Neural Networks. Most of the proposed rule extraction methods required specialized type 
of Neural Networks; some required binary inputs and some were computationally 
expensive. Craven proposed extracting MofN type Decision Trees from Neur ... 
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4 Generalized additive neural networks Q 
William J. E. Potts 

August 1999 Proceedings of the fifth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: ^ pdf(628.73 KB) Additional Information: full citation, references , index terms 



Keywords: additive models, partial residuals, predictive modeling 



5 Technical Correspondence: A neural net compiler system for hierarchical Q 
^ organization 
■ Rajeev Kumar 
February 2001 ACM SIGPLAN Notices, volume 36 issue 2 

Publisher: ACM Press 

Full text available: g pdfl954.76 KB) Additional Information: full citation , abstract , references 

We present a language framework for handling arbitrarily complex neural computations. 
The software architecture - which we call an Artificial /Veural Network Compiler for 
Hierarchical Organization (ANCHOR) - facilitates network hierarchy and simpler sub- 
mappings. We define a Net Definition Language (NDL) which is implemented in object- 
oriented programming paradigm; a trained network is decompiled bac ... 

Keywords: compiler-decompiler, hierarchical networks, neural net definitions, neural 
programming language, superneuron 



6 Neural network approach to solving the Traveling Salesman Problem Q 
Ralph Reilly, Plamen Tchimev 

October 2003 Journal of Computing Sciences in Colleges, volume 19 issue l 
Publisher: Consortium for Computing Sciences in Colleges 

Full text available: ^ pdf(233.89 KB) Additional Information: full citation , abstract , references , index terms 

The Traveling Salesman Problem involves mapping a route for a salesman to visit each 
city, without stopping in the same city twice, in the shortest route possible. A map of 
Germany is used as the test data for the study in this report. 

7 Selective Rademacher Penalization and Reduced Error Pruning of Decision Trees Q 
Matti Kaariainen, Tuomo Malinen, Tapio Elomaa 

December 2004 The Journal of Machine Learning Research, Volume 5 
Publisher: MIT Press 

Full text available: g pdf(191.42 KB) Additional Information: full citation , abstract , references 

Rademacher penalization is a modern technique for obtaining data-dependent bounds on 
the generalization error of classifiers. It appears to be limited to relatively simple 
hypothesis classes because of computational complexity issues. In this paper we, 
nevertheless, apply Rademacher penalization to the in practice important hypothesis class 
of unrestricted decision trees by considering the prunings of a given decision tree rather 
than the tree growing phase. This study constitutes the first appl ... 

8 Neural networks and dynamic complex systems Q 
Geoffrey Fox, Wojtek Furmanski, Alex Ho, Jeff Koller, Peter Simlc, Isaac Wong 

March 1989 Proceedings of the 22nd annual symposium on Simulation ANSS '89 
Publisher: IEEE Computer Society Press 

Full text available: ^j| pdfl1.44 MB1 Additional Information: full citation , abstract, references , index terms 

We describe the use of neural networks for optimization and inference associated with a 
variety of complex systems. We show how a string formalism can be used for parallel 
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computer decomposition, message routing and sequential optimizing compilers. We extend 
these ideas to a general treatment of spatial assessment and distributed artificial 
intelligence. 

9 Neural methods for dynamic branch prediction Q 
Daniel A. Jimenez, Calvin Lin 

November 2002 ACM Transactions on Computer Systems (TOCS), volume 20 issue 4 
Publisher: ACM Press 

Full text available: pdf(540.67 KB) Additional Information: full citation , abstract, references , index terms 

This article presents a new and highly accurate method for branch prediction. The key idea 
is to use one of the simplest possible neural methods, the perceptron, as an alternative to 
the commonly used two-bit counters. The source of our predictor's accuracy is its ability to 
use long history lengths, because the hardware resources for our method scale linearly, 
rather than exponentially, with the history length. We describe two versions of perceptron 
predictors, and we evaluate these predictors ... 

Keywords: Branch prediction, neural networks 




10 Identification of parallelism in neural networks by simulation with language J. 
Alexei N. Skurihin, Alvin J. Surkan 

September 1993 ACM SIGAPL APL Quote Quad , Proceedings of the international 

conference on APL APL '93, Volume 24 Issue 1 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings, index 



Full text available: V™-^ 

t£a ^ terms 

Neural networks, trained by backpropagation, are designed and described in the language 
J, an APL derivative with powerful function encapsulation features. Both the languages J 
[4,6,7] and APL [5] help to identify and isolate the parallelism that is inherent in network 
training algorithms. Non-critical details of data input and derived output processes are de- 
emphasized by relegating those functions to callable stand-alone modules. Such input and 
output ... 

11 Using artificial neural nets to predict academic performance 
Al Cripps 

February 1996 Proceedings of the 1996 ACM symposium on Applied Computing 
Publisher: ACM Press 

Full text available: I p pdf(543.55 KB) Additional Information: full citation , references , index terms 



Keywords: academic, neural networks, prediction 



12 Classification and regression: money *can* grow on trees 
Johannes Gehrke, Wie-Yin Loh, Raghu Ramakrishnan 

August 1999 Tutorial notes of the fifth ACM SIGKDD international conference on 

Knowledge discovery and data mining 
Publisher: ACM Press 

. IU s ■, ui « J</onc MD v Additional Information: full citation , abstract , references , citings , index 

Full text available: tra pdf(2.95 MB) 

^ terms 

With over 800 million pages covering most areas of human endeavor, the World-wide Web 
is a fertile ground for data mining research to make a difference to the effectiveness of 
information search. Today, Web surfers access the Web through two dominant interfaces 
clicking on hyperlinks and searching via keyword queries This process is often tentative 
and unsatisfactory Better support is needed for expressing one's information need and 
dealing with a search result in more structured ways than ... 
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13 Neural network for partitionable variational inequalities Q 
Giulia Rotondo 

July 1998 ACM SIGAPL APL Quote Quad , Proceedings of the APL98 conference on 

Array processing language APL '98, Volume 29 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(374.98 KB) Additional Information: full citation , abstract , index terms 

Often in recent times industries have asked mathematicians to determine their "true" 
utility functions directly from the available data about used resources and about the 
corresponding profits in order to optimize the latter with respect to the former. The 
possibility of determining the utility function directly from the data is very important 
because in this way the exact situation of the company is described. Moreover, the biggest 
companies divide their investments into several activities. The ... 

14 Book review: Fundamentals of Artificial Neural Networks by Mohamad H. Hassoun Q 
M (MIT PRESS. 1996) 

Emilio Sofia Olivas 

December 1997 ACM SIGART Bulletin, Volume 8 Issue 1-4 
Publisher: ACM Press 

Full text available: 'g| pdf(420.56 KB) Additional Information: full citation , abstract , references 

The field of artificial neural networks has grown substantially in recent years accompanied 
by an increased number of neural networks textbooks. These books attempt to give a 
broad introduction to both the theory and use of neural networks, such as (Hertz, 1991), 
(Kung, 1993), (Haykin, 1994), and (Rojas, 1996). Hassoun's book, like these, is a very 
good textbook for first year undergraduate students who are learning the computational 
abilities of neural networks. In addition it has some value to ... 

15 A hybrid of neural net and branch and bound techniques for seismic horizon tracking Q 
^g^, Jay Veezhinathan, Frank Kemp, Jody Threet 

^ March 1993 Proceedings of the 1993 ACM/SIGAPP symposium on Applied computing: 
states of the art and practice 
Publisher: ACM Press 

Full text available: *g| pdf(468.21 KB) Additional Information: full citation , references , citings , index terms 




16 Sparse bayesian learning and the relevance vector machine 
Michael E. Tipping 

September 2001 The Journal of Machine Learning Research, Volume l 
Publisher: MIT Press 

Full text available: pdf(999.88 KB) Additional Information: full citation , abstract , references, citings 

This paper introduces a general Bayesian framework for obtaining sparse solutions to 
regression and classification tasks utilising models linear in the parameters. Although this 
framework is fully general, we illustrate our approach with a particular specialisation that 
we denote the 'relevance vector machine' (RVM), a model of identical functional form to 
the popular and state-of-the-art 'support vector machine' (SVM). We demonstrate that by 
exploiting a probabilistic Bayesian learning framewor ... 

17 Morphology & tagging: Part-of-speech tagging with neural networks Q 
Helmut Schmid 

August 1994 Proceedings of the 15th conference on Computational linguistics - 
Volume 1 

Publisher: Association for Computational Linguistics 

Full text available: ^j) pdf(396.49 KB) Additional Information: full citation , abstract , references , citings 

Text corpora which are tagged with part-of-speech information are useful in many areas of 
linguistic research. In this paper, a new part-of-speech tagging method based on neural 
networks (Net-Tagger) is presented and its performance is compared to that of a HMM- 
tagger (Cutting et al., 1992) and a trigram-based tagger (Kempe, 1993). It is shown that 
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the Net-Tagger performs as well as the trig ram -based tagger and better than the HMM- 
tagger. 

18 Tree induction vs. logistic regression: a learning-curve analysis Q 
Claudia Perlich, Foster Provost, Jeffrey S. Simonoff 

December 2003 The Journal of Machine Learning Research, Volume 4 

Publisher: MIT Press 

Full text available:1 paDdf(263.37KB) Additional lnformation: Mdtatjon, abstract, references , citings, index 

terms 

Tree induction and logistic regression are two standard, off-the-shelf methods for building 
models for classification. We present a large-scale experimental comparison of logistic 
regression and tree induction, assessing classification accuracy and the quality of rankings 
based on class-membership probabilities. We use a learning-curve analysis to examine the 
relationship of these measures to the size of the training set. The results of the study show 
several things. (1) Contrary to some prior o ... 

19 Randomized Variable Elimination Q 
David 3. Stracuzzi, Paul E. Utgoff 

December 2004 The Journal of Machine Learning Research, Volume 5 
Publisher: MIT Press 

Full text available: ^ pdf(273.39 KB) Additional Information: full citation , abstract , references 

Variable selection, the process of identifying input variables that are relevant to a 
particular learning problem, has received much attention in the learning community. 
Methods that employ a learning algorithm as a part of the selection process (wrappers) 
have been shown to outperform methods that select variables independently from the 
learning algorithm (filters), but only at great computational expense. We present a 
randomized wrapper algorithm whose computational requirements are within a c ... 

20 Book review: Neural Networks for Pattern Recognition By Christopher M. Bishop Q 

(Clarendon Press. 1995) 
H. Van Dyke Parunak 
June 1998 ACM SIGART Bulletin, Volume 9 issue l 

Publisher: ACM Press 

Full text available: ^ pdf(385.24 KB) Additional Information: full citation , abstract 

Technical textbooks tend to be encyclopedic omnibus volumes, since the market rewards 
authors whose books can be used by teachers in many different classes. Pedagogically, 
this tendency is unfortunate, since it tends to homogenize the unique approaches and 
presentations that make great teachers truly effective. Bishop's textbook on neural nets is 
a welcome exception to the rule. It specifically limits its focus to feed-forward nets with 
explicit training. Much important work has been done in dom ... 
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21 Tree-Based Batch Mode Reinforcement Learning Q 
Damien Ernst, Pierre Geurts, Louis Wehenkel 

September 2005 The Journal of Machine Learning Research, volume 6 
Publisher: MIT Press 

Full text available: ^| pdff1.32 MB) Additional Information: full citation , abstract 

Reinforcement learning aims to determine an optimal control policy from interaction with a 
system or from observations gathered from a system. In batch mode, it can be achieved 
by approximating the so-called Q-function based on a set of four-tuples (x t , u t , r t , x t+1 ) 

where x t denotes the system state at time t, u t the control action taken, r t the 

instantane ... 

22 Tree-Based Batch Mode Reinforcement Learning Q 
Damien Ernst, Pierre Geurts, Louis Wehenkel 

September 2005 The Journal of Machine Learning Research, volume 6 
Publisher: MIT Press 

Full text available: ^|pdf(1.41 MB) Additional Information: full citation , abstract , citings 

Reinforcement learning aims to determine an optimal control policy from interaction with a 
system or from observations gathered from a system. In batch mode, it can be achieved 
by approximating the so-called Q-function based on a set of four-tuples (x t , u t , r v x t+1 ) 

where x t denotes the system state at time t, u t the control action taken, r t the 

instantane ... 

23 Book review: An Introduction to Neural and Electronic Networks. Edited by Steven E Q 
^ Zornetzer. Joel L. Davis, and Clifford Lau (Academic Press) 

^ James A. Reggia 

October 1990 ACM SIGART Bulletin, volume l issue 3 

Publisher: ACM Press 

Full text available: * g] pdf(245.53 KB) Additional Information: full citation , abstract 

The volume is a collection of papers on neural networks (real and artificial) edited by staff 
members of the Office of Naval Research (ONR). ONR has been a major supporter of 
research in this area for many years and the editors have directly overseen some of the 
work described in this book, making them especially qualified to organize this collection. 
The chapters themselves are authored by talented individuals in neuroscience, 
engineering, physics, psychology, computer science and mathematics, ... 

24 Learning bias and phonological-rule induction Q 
Daniel Gildea, Daniel Jurafsky 
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December 1996 Computational Linguistics, Volume 22 issue 4 
Publisher: MIT Press 
Full text available 



^pdf(2,25MB)gjl! Additional Information: full citation, abstract , references , citings 
Publisher Site 

A fundamental debate in the machine learning of language has been the role of prior 
knowledge in the learning process. Purely nativist approaches, such as the Principles and 
Parameters model, build parameterized linguistic generalizations directly into the learning 
system. Purely empirical approaches use a general, domain-independent learning rule 
(Error Back-Propagation, Instance-based Generalization, Minimum Description Length) to 
learn linguistic generalizations directly from the data. In this ... 

25 Round robin classification 
Johannes Furnkranz 

March 2002 The Journal of Machine Learning Research, Volume 2 
Publisher: MIT Press 

Full text available: fB pdf(250.25 KB) Additional lnformation ' citation , references, citings, index 

"™ terms 

In this paper, we discuss round robin classification (aka pairwise classification), a 
technique for handling multi-class problems with binary classifiers by learning one 
classifier for each pair of classes. We present an empirical evaluation of the method, 
implemented as a wrapper around the Ripper rule learning algorithm, on 20 multi-class 
datasets from the UCI database repository. Our results show that the technique is very 
likely to improve Ripper's classification accuracy without having a hi ... 

Keywords: class binarization, ensemble techniques, inductive rule learning, multi-class 
problems, pairwise classification 



26 Special Issue on learning theory: Path kernels and multiplicative updates Q 
Eiji Takimoto, Manfred K. Warmuth 

December 2003 The Journal of Machine Learning Research, Volume 4 
Publisher: MIT Press 

Full text available: g pdf(353.20 KB) Additional Information: full citation , abstract , index terms 

Kernels are typically applied to linear algorithms whose weight vector is a linear 
combination of the feature vectors of the examples. On-line versions of these algorithms 
are sometimes called "additive updates" because they add a multiple of the last feature 
vector to the current weight vector.In this paper we have found a way to use special 
convolution kernels to efficiently implement "multiplicative" updates. The kernels are 
defined by a directed graph. Each edge contributes an input. The inp ... 

27 Artificial immune systems: RABNET: a real-valued antibody network for data Q 
^ clustering 

^ Helder Knidel, Leandro N. de Castro, Fernando J. Von Zuben 

June 2005 Proceedings of the 2005 conference on Genetic and evolutionary 

computation GECCO '05 
Publisher: ACM Press 

Full text available: ^pdf(201.49 KB) Additional Information: full citation , abstract , references , index terms 

This paper proposes a novel constructive learning algorithm for a competitive neural 
network. The proposed algorithm is developed by taking ideas from the immune system 
and demonstrates robustness in the initial experiments reported here for a benchmark 
problem. Comparisons with results from the literature are also provided. To automatically 
segment the resultant neurons at the output, a tool from graph theory was used with 
promising results. General discussions and avenues for future works are ... 

Keywords: artificial immune systems, artificial neural networks, data clustering 



http://portal.acm.org/resultsxfm?query=%22neural%22%20 6/23/2006 



Results (page 2): "neural" and "prune" and "zero input" 



Page 3 of 6 



28 Integrating POMDP and reinforcement learning for a two laver simulated robot 
architecture 

Larry D. Pyeatt, Adele E. Howe 

April 1999 Proceedings of the third annual conference on Autonomous Agents 

Publisher: ACM Press 

Full text available: ^ pdf(853.34 KB) Additional Information: full citation , references , citings , index terms 




29 Inducing history representations for broad coverage statistical parsing Q 
James Henderson 

May 2003 Proceedings of the 2003 Conference of the North American Chapter of the 
Association for Computational Linguistics on Human Language Technology 
- Volume 1 NAACL v 03 

Publisher: Association for Computational Linguistics 

Full text available: ^pdf(1 29.57 KB) Additional Information: full citation , abstract , references 

We present a neural network method for inducing representations of parse histories and 
using these history representations to estimate the probabilities needed by a statistical 
left-corner parser. The resulting statistical parser achieves performance (89.1% F- 
measure) on the Penn Treebank which is only 0.6% below the best current parser for this 
task, despite using a smaller vocabulary size and less prior linguistic knowledge. Crucial to 
this success is the use of structurally determined soft bi ... 

30 Using a mixture of probabilistic decision trees for direct prediction of protein function Q 
^ Umar Syed, Golan Yona 

y*. April 2003 Proceedings of the seventh annual international conference on Research 
in computational molecular biology RECOMB '03 

Publisher: ACM Press 

Full text available: pdf(306.22 KB) Additional Information: full citation , abstract , references , index terms 

We study the direct relationship between basic protein properties and their function. Our 
goal is to develop a new tool for functional prediction that can be used to complement and 
support other techniques based on sequence or structure information. In order to define 
this new measure of similarity between proteins we collected a set of 453 features and 
properties that characterize proteins and are believed to be correlated and related to 
structural and functional aspects of proteins. Among thes ... 

Keywords: decision trees, functional prediction, sequence-function relationships 



31 Research track: Mining concept-drifting data streams using ensemble classifiers 
^ Haixun Wang, Wei Fan, Philip S. Yu, Jiawei Han 

August 2003 Proceedings of the ninth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: fSl P dft234.13 KB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

Recently, mining data streams with concept drifts for actionable insights has become an 
important and challenging task for a wide range of applications including credit card fraud 
protection, target marketing, network intrusion detection, etc. Conventional knowledge 
discovery tools are facing two challenges, the overwhelming volume of the streaming data, 
and the concept drifts. In this paper, we propose a general framework for mining concept- 
drifting data streams using weighted ensemble classifi ... 

Keywords: classifier, classifier ensemble, concept drift, data streams 
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Special issue on special feature: Mips (mono layer polynomials and multi layer Q 
perceptions) for nonlinear modeling 
Isabelle Rivals, Leon Personnaz 

March 2003 The Journal of Machine Learning Research, volume 3 
Publisher: MIT Press 

Full text available: ^ pdf(247.17 KB) Additional Information: full citation , abstract, citings , index terms 

This paper presents a model selection procedure which stresses the importance of the 
classic polynomial models as tools for evaluating the complexity of a given modeling 
problem, and for removing non-significant input variables. If the complexity of the 
problem makes a neural network necessary, the selection among neural candidates can be 
performed in two phases. In an additive phase, the most important one, candidate neural 
networks with an increasing number of hidden neurons are trained. The a ... 

33 Research sessions: stream management: Online event-driven subsequence Q 

matching over financial data streams 
^ Huanmei Wu, Betty Salzberg, Donghui Zhang 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: ^| pdf(753.59 KB) Additional Information: full citation , abstract , references 

Subsequence similarity matching in time series databases is an important research area 
for many applications. This paper presents a new approximate approach for automatic 
online subsequence similarity matching over massive data streams. With a simultaneous 
on-line segmentation and pruning algorithm over the incoming stream, the resulting 
piecewise linear representation of the data stream features high sensitivity and accuracy. 
The similarity definition is based on a permutation followed by a met ... 

34 Knowledge discovery using neural networks Q 
Khosrow Kaikhah, Sandesh Doddameti 

May 2004 Proceedings of the 17th international conference on Innovations in 
applied artificial intelligence IEA/AIE'2004 

Publisher: Springer Springer Verlag Inc 
Additional Information: full citation , abstract , index terms 

A novel knowledge discovery technique using neural networks is presented. A neural 
network is trained to learn the correlations and relationships that exist in a dataset. The 
neural network is then pruned and modified to generalize the correlations and 
relationships. Finally, the neural network is used as a tool to discover all existing hidden 
trends in four different types of crimes in US cities as well as to predict trends based on 
existing knowledge inherent in the network. 

35 A large project for demonstrating knowledge engineering techniques including Q 
applications of neural networks 
Mary Micco, Phillip Cumpston 

February 1990 ACM SIGCSE Bulletin , Proceedings of the twenty-first SIGCSE 

technical symposium on Computer science education SIGCSE '90, 

Volume 22 Issue 1 
Publisher: ACM Press 

Full text available: ^ pdf(641.81 KB) Additional Information: full citation , references , index terms 



36 Paper session KM-3 (knowledge management): classification & clustering: Collective Q 
multi-label classification 
Nadia Ghamrawi, Andrew McCallum 

October 2005 Proceedings of the 14th ACM international conference on Information 
and knowledge management CIKM '05 

Publisher: ACM Press 
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Full text available: ||| pdfl93.06 KB) Additional Information: full citation , abstract , references, index terms 

Common approaches to multi-label classification learn independent classifiers for each 
category, and employ ranking or thresholding schemes for classification. Because they do 
not exploit dependencies between labels, such techniques are only well-suited to problems 
in which categories are independent. However, in many domains labels are highly 
interdependent. This paper explores multi-label conditional random field (CRF)classification 
models that directly parameterize label co-occurrences in mul ... 

Keywords: classification, machine learning, multi-label, statistical learning, uncertainty 



37 Parameter space exploration with Gaussian process trees 
Ay Robert B. Gramacy, Herbert K. H. Lee, William G. Macready 

July 2004 Proceedings of the twenty-first international conference on Machine 

learning ICML '04 
Publisher: ACM Press 

Full text available: pdf(275.38 KB) Additional Information: full citation , abstract , references 

Computer experiments often require dense sweeps over input parameters to obtain a 
qualitative understanding of their response. Such sweeps can be prohibitively expensive, 
and are unnecessary in regions where the response is easy predicted; well-chosen designs 
could allow a mapping of the response with far fewer simulation runs. Thus, there is a 
need for computationally inexpensive surrogate models and an accompanying method for 
selecting small designs. We explore a general methodology for addres ... 

38 Essential classification rule sets 
^ Elena Baralis, Silvia Chiusano 

December 2004 ACM Transactions on Database Systems (TODS), Volume 29 issue 4 

Publisher: ACM Press 

Full text available: *^ pdf(479.09 KB) Additional Information: full citation, abstract , references , index terms 

Given a class model built from a dataset including labeled data, classification assigns a 
new data object to the appropriate class. In associative classification the class model (i.e., 
the classifier) is a set of association rules. Associative classification is a promising 
technique for the generation of highly accurate classifiers. In this article, we present a 
compact form which encodes without information loss the classification knowledge 
available in a classification rule set. This form incl ... 

Keywords: Association rules, associative classification, concise representations 



39 Selective Markov models for predicting Web page accesses 
^ Mukund Deshpande, George Karypis 

^ May 2004 ACM Transactions on Internet Technology (TOIT), Volume 4 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 



Full text available: ' r _ _ 

m terms 

The problem of predicting a user's behavior on a Web site has gained importance due to 
the rapid growth of the World Wide Web and the need to personalize and influence a user's 
browsing experience. Markov models and their variations have been found to be well 
suited for addressing this problem. Of the different variations of Markov models, it is 
generally found that higher-order Markov models display high predictive accuracies on 
Web sessions that they can predict. However, higher-order models a ... 

Keywords: Markov models, World wide web, predicting user behavior, web mining 
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Douglas A. Talbert, Doug Fisher 

August 2000 Proceedings of the sixth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: ^ pdf(108.72 KB) Additional Information: full citation, references , index terms 
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41 Adaptive multilingual sentence boundary disambiguation 
David D. Palmer, Marti A. Hearst 

June 1997 Computational Linguistics, volume 23 issue 2 
Publisher: MIT Press 

Full text available. ^ 77 (|| Add j tjona j | n f orma tj on: full citation , abstract , references , citings 
Publisher Site 

The sentence is a standard textual unit in natual language processing applications. In 
many language the punctuation mark that indicates the end-of-sentence boundary is 
ambiguous; thus the tokenizers of most NLP systems must be equipped with special 
sentence boundary recognition rules for every new text collection.As an alternative, this 
article presents an efficient, trainable system for sentence boundary disambiguation. The 
system, called Satz, makes simple estimates of the parts of speech of ... 

42 Efficient search for association rules 

# Geoffrey I. Webb 
August 2000 Proceedings of the sixth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: | ||pdf(1 86.48 KB) Additional Information: full citation , references , citings , index terms 



Keywords: association rule, search 



43 The subspace information criterion for infinite dimensional hypothesis spaces 
Masashi Sugiyama, Klaus-Robert Muller 

March 2003 The Journal of Machine Learning Research, volume 3 
Publisher: MIT Press 

Full text available: ^ pdf(532.09 KB) Additional lnformation: Mcitation, abstract, references , citings, index 

terms 

A central problem in learning is selection of an appropriate model. This is typically done by 
estimating the unknown generalization errors of a set of models to be selected from and 
then choosing the model with minimal generalization error estimate. In this article, we 
discuss the problem of model selection and generalization error estimation in the context 
of kernel regression models, e.g., kernel ridge regression, kernel subset regression or 
Gaussian process regression. Previously, a no ... 

Keywords: Gaussian processes, cross-validation, finite sample statistics, generalization 
error, kernel regression, model selection, reproducing kernel Hilbert space, subspace 
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44 Incremental learning of linear model trees Q 
Duncan Potts 

July 2004 Proceedings of the twenty-first international conference on Machine 
learning ICML '04 

Publisher: ACM Press 

Full text available: ^pdf(184.10 KB) Additional Information: full citation , abstract, references 

A linear model tree is a decision tree with a linear functional model in each leaf. Previous 
model tree induction algorithms have operated on the entire training set, however there 
are many situations when an incremental learner is advantageous. In this paper we 
demonstrate that model trees can be induced incrementally using an algorithm that scales 
linearly with the number of examples. An incremental node splitting rule is presented, 
together with incremental methods for stopping the growth of ... 

45 Subspace clustering for high dimensional data: a review Q 
^ Lance Parsons, Ehtesham Haque, Huan Liu 

^ June 2004 ACM SIGKDD Explorations Newsletter, volume 6 issue l 
Publisher: ACM Press 

Full text available: |g| pdf(539.13 KB) Additional Information: full citation, abstract , references 

Subspace clustering is an extension of traditional clustering that seeks to find clusters in 
different subspaces within a dataset. Often in high dimensional data, many dimensions are 
irrelevant and can mask existing clusters in noisy data. Feature selection removes 
irrelevant and redundant dimensions by analyzing the entire dataset. Subspace clustering 
algorithms localize the search for relevant dimensions allowing them to find clusters that 
exist in multiple, possibly overlapping subspaces. The ... 

Keywords: clustering survey, high dimensional data, projected clustering, subspace 
clustering 




46 Evidence-based static branch prediction using machine learning Q 
jgk Brad Calder, Dirk Grunwald, Michael Jones, Donald Lindsay, James Martin, Michael Mozer, 
Benjamin Zorn 

January 1997 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 19 Issue 1 
Publisher: ACM Press 

r- II* ^ i ui 01 ^/c^ccnuDN Additional Information: full citation , abstract , references , citings , index 

Full text available: TE 3pdf(515.50 KB) ; 

terms 

Correctly predicting the direction that branches will take is increasingly important in 
today's wide-issue computer architectures. The name program-based branch prediction is 
given to static branch prediction techniques that base their prediction on a program's 
structure. In this article, we investigate a new approach to program-based branch 
prediction that uses a body of existing programs to predict the branch behavior in a new 
program. We call this approach to program-ba ... 

Keywords: branch prediction, decision trees, machine learning, neural networks, 
performance evaluation, program optimization 



47 Session: A fast partial parse of natural language sentences using a connectionist 
method 

Caroline Lyon, Bob Dickerson 

March 1995 Proceedings of the seventh conference on European chapter of the 

Association for Computational Linguistics 
Publisher: Morgan Kaufmann Publishers Inc. 
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The pattern matching capabilities of neural networks can be used to locate syntactic 
constituents of natural language. This paper describes a fully automated hybrid system, 
using neural nets operating within a grammatic framework. It addresses the 
representation of language for connectionist processing, and describes methods of 
constraining the problem size. The function of the network is briefly explained, and results 
are given. 

48 Classifying text documents by associating terms with text categories 
Osmar R. Zai'ane, Maria-Luiza Antonie 

January 2002 Australian Computer Science Communications , Proceedings of the 

thirteenth Australasian conference on Database technologies - Volume 
5 CRPITS '02, Volume 24 Issue 2 

Publisher: Australian Computer Society, Inc. , IEEE Computer Society Press 

i- ii * •. u. a Aim cxa * a n\ Additional Information: full citation , abstract, references , citings , index 
Full text available: 1Sq pdf(L04 MB) 

Automatic text categorization has always been an important application and research topic 
since the inception of digital documents. Today, text categorization is a necessity due to 
the very large amount of text documents that we have to deal with daily. Many techniques 
and algorithms for automatic text categorization have been devised and proposed in the 
literature. However, there is still much room for improving the effectiveness of these 
classifiers, and new models need to be examined. We prop ... 

Keywords: association rules, classification, text categorization, text mining 



49 Learning and making decisions when costs and probabilities are both unknown ^ 
Bianca Zadrozny, Charles Elkan 

August 2001 Proceedings of the seventh ACM SIGKDD international conference on 

Knowledge discovery and data mining 
Publisher: ACM Press 

Full text available: ■ pdft920.23 KB) Additional Information: full citation , abstract, references , citings, index 

terms 

In many data mining domains, misclassification costs are different for different examples, 
in the same way that class membership probabilities are example-dependent. In these 
domains, both costs and probabilities are unknown for test examples, so both cost 
estimators and probability estimators must be learned. After discussing how to make 
optimal decisions given cost and probability estimates, we present decision tree and naive 
Bayesian learning methods for obtaining well-calibrated probability ... 

50 A review of vessel extraction techniques and algorithms Q 
Cemil Kirbas, Francis Quek 

June 2004 ACM Computing Surveys (CSUR), Volume 36 issue 2 
Publisher: ACM Press 

Full text available: |p pdf(8:06 MB) Additional Information: full citation , abstract , references , index terms 

Vessel segmentation algorithms are the critical components of circulatory blood vessel 
analysis systems. We present a survey of vessel extraction techniques and algorithms. We 
put the various vessel extraction approaches and techniques in perspective by means of a 
classification of the existing research. While we have mainly targeted the extraction of 
blood vessels, neurosvascular structure in particular, we have also reviewed some of the 
segmentation methods for the tubular objects that show ... 

Keywords: Magnetic resonance angiography, X-ray angiography, medical imaging, 
neurovascular, vessel extraction 
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51 A novel ensemble-based scoring and search algorithm for protein redesign, and its Q 
application to modify the substrate specificity of the gramicidin synthetase A 
phenylalanine adenylation enzyme 

Ryan H. Lilien, Brian W. Stevens, Amy C. Anderson, Bruce R. Donald 

March 2004 Proceedings of the eighth annual international conference on Resaerch in 
computational molecular biology RECOMB '04 

Publisher: ACM Press 

Full text available: ^pdf(2.36 MB) Additional Information: full citation, abstract, references , index terms 

Realization of novel molecular function requires the ability to alter molecular complex 
formation. Enzymatic function can be altered by changing enzyme-substrate interactions 
via modification of an enzyme's active site. A redesigned enzyme may either perform a 
novel reaction on its native substrates or its native reaction on novel substrates. A number 
of computational approaches have been developed to address the combinatorial nature of 
the protein redesign problem. These approaches typically se ... 

Keywords: enzyme design, fluorescence binding assay, molecular ensemble, non- 
ribosomal peptide synthetase, protein design, protein flexibility, protein-ligand binding 




52 Statistics and data mining techniques for lifetime value modeling Q 
D. R. Mani, James Drew, Andrew Betz, Piew Datta 

August 1999 Proceedings of the fifth ACM SIGKDD international conference on 
Knowledge discovery and data mining 

Publisher: ACM Press 

Full text available: | |pdf(1.16 MB) Additional Information: full citation , references , citings , index terms 




Keywords: lifetime value, neural networks, proportional hazards regression, survival 
analysis, tenure prediction 



53 Research papers: spatial and high-dimensional data: Robust and fast similarity Q 
search for moving object trajectories 
Lei Chen, M. Tamer Ozsu, Vincent Oria 

June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: 'fH pdf(811.42 KB) Additional Information: full citation , abstract , references 

An important consideration in similarity-based retrieval of moving object trajectories is the 
definition of a distance function. The existing distance functions are usually sensitive to 
noise, shifts and scaling of data that commonly occur due to sensor failures, errors in 
detection techniques, disturbance signals, and different sampling rates. Cleaning data to 
eliminate these is not always possible. In this paper, we introduce a novel distance 
function, Edit Distance on Real sequence (EDR) whic ... 

54 Training connectionist models for the structured language model Q 
Peng Xu, Ahmad Emami, Frederick Jelinek 

July 2003 Proceedings of the 2003 conference on Empirical methods in natural 
language processing - Volume 10 

Publisher: Association for Computational Linguistics 

Full text available: ^ pdf(147.24 KB) Additional Information: full citation , abstract , references 

We investigate the performance of the Structured Language Model (SLM) in terms of 
perplexity (PPL) when its components are modeled by connectionist models. The 
connectionist models use a distributed representation of the items in the history and make 
much better use of contexts than currently used interpolated or back-off models, not only 
because of the inherent capability of the connectionist model in fighting the data 
sparseness problem, but also because of the sublinear growth in the model si ... 
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55 A survey on wavelet applications in data mining ^ 
Tao Li, Qi Li, Shenghuo Zhu, Mitsunori Ogihara 

December 2002 ACM SIGKDD Explorations Newsletter Volume 4 issue 2 
Publisher: ACM Press 

Full text available: ^ pdf(330.06 KB) Additional Information: full citation , abstract , references , citings 

Recently there has been significant development in the use of wavelet methods in various 
data mining processes. However, there has been written no comprehensive survey 
available on the topic. The goal of this is paper to fill the void. First, the paper presents a 
high-level data-mining framework that reduces the overall process into smaller 
components. Then applications of wavelets for each component are reviewd. The paper 
concludes by discussing the impact of wavelets on data mining research an ... 

56 Evolving and messaging decision-making agents Q 

# Edmund S. Yu 
May 2001 Proceedings of the fifth international conference on Autonomous agents 

Publisher: ACM Press 

Full text available: ^ pdf(240.53 KB) Additional Information: full citation , abstract , references , index terms 

In this paper we describe our neurogenetic approach to developing a multi- agent decision 
support system which assists users in gathering, merging, analyzing, and using 
information to assess risks and make recommendations in situations that may require 
tremendous amounts of time and attention of the users. In Phase I of this project, called 
the EMMA project, we demonstrated the feasibility of a set of solutions to various problems 
by building an intelligent agent application that makes reco ... 

Keywords: adaptation and learning, agent communication languages, evolution of agents, 
information agents, multi-agent communication/collaboration 




57 Research track paper: A multiple tree algorithm for the efficient association of 
asteroid observations 

Jeremy Kubica, Andrew Moore, Andrew Connolly, Robert Jedicke 

August 2005 Proceeding of the eleventh ACM SIGKDD international conference on 
Knowledge discovery in data mining KDD '05 

Publisher: ACM Press 

Full text available: ^ pdf(398.50 KB) Additional Information: full citation , abstract , references , index terms 

In this paper we examine the problem of efficiently finding sets of observations that 
conform to a given underlying motion model. While this problem is often phrased as a 
tracking problem, where it is called track initiation, it is useful in a variety of tasks where 
we want to find correspondences or patterns in spatial-temporal data. Unfortunately, this 
problem often suffers from a combinatorial explosion in the number of potential sets that 
must be evaluated. We consider the problem with respe ... 

Keywords: multiple tree algorithms, track initiation 




58 Combining multi-visual features for efficient indexing in a large image database Q 
Anne H. H. Ngu, Quan Z. Sheng, Du Q. Huynh, Ron Lei 

April 2001 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 9 Issue 4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(493.09 KB) Additional Information: full citation , abstract , citings, index terms 

The optimized distance-based access methods currently available for multidimensional 
indexing in multimedia databases have been developed based on two major assumptions: 
a suitable distance function is known a priori and the dimensionality of the image features 
is low. It is not trivial to define a distance function that best mimics human visual 
perception regarding image similarity measurements. Reducing high-dimensional features 
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59 Aggregate nearest neighbor queries in spatial databases Q 
Dimitris Papadias, Yufei Tao, Kyriakos Mouratidis, Chun Kit Hui 
June 2005 ACM Transactions on Database Systems (TODS), Volume 30 issue 2 

Publisher: ACM Press 

Full text available: ^pdf(3.84 MB) Additional Information: full citation , abstract , references , index terms 

Given two spatial datasets P (e.g., facilities) and Q (queries), an aggregate nearest 
neighbor (ANN) query retrieves the point(s) of P with the smallest aggregate distance(s) to 
points in Q. Assuming, for example, n users at locations ql,.,.qn, an ANN query outputs 
the facility p e P that minimizes the sum of distances &verbar;pgi&verbar; for 1 s£ / ... 

Keywords: Spatial database, aggregation, nearest neighbor queries 




60 Machine learning in automated text categorization 
Fabrizio Sebastiani 

March 2002 ACM Computing Surveys (CSUR), volume 34 issue 1 
Publisher: ACM Press 

Additional Information: full citation , abstract , references, citings , index 



Full text available: HW5 = ^ ^ 

^ terms 

The automated categorization (or classification) of texts into predefined categories has 
witnessed a booming interest in the last 10 years, due to the increased availability of 
documents in digital form and the ensuing need to organize them. In the research 
community the dominant approach to this problem is based on machine learning 
techniques: a general inductive process automatically builds a classifier by learning, from a 
set of preclassified documents, the characteristics of the categories. ... 

Keywords: Machine learning, text categorization, text classification 



Results 41 - 60 of 200 Result page: previous 12345678910 next 

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2006 ACM, Inc. 
Terms of Usage Privacy Policy Code of Ethics Contact Us 



Useful downloads: ^ Adobe Acrobat QuickTime HI Windows Media Plaver ^ Real Player 



http://portal.acm.org/resultsxfm?query=%22neural%22%20and%20%22prune%22%20an... 6/23/2006 



+"neural" +"prune" +"zero input" - Google Search 



Page 1 of 2 
Sign in 



Google \ 



Web Images Groups News Froogle Maps more » 





Advanced Search 
Preferences 



Web 



Results 1 - 10 of about 27 for +" neural " + ,, prune" -'■"zero input ". (0.08 seconds) 



rPDn NEURAL NETWORKS: Basics using MATLAB Neural Network Toolbox By ... 
File Format: PDF/Adobe Acrobat - View as HTML 
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